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POTYVIRUS COAT PROTEIN GENES AND PLANTS TRANSFORMED THEREWITH 

FIELD OF INVENTION 
The present invention relates to the coat protein genes of 
potyviruses. More specifically the invention relates to a process for 
5 preparing a coat protein gene from a potyvirus as well as its 
incorporation into a transfer vector, and its use in producing 
transformed plant cells and transformed plants which are resistant to 
viral infections by the particular potyvirus and related viruses from 
which the gene is derived. 
10 BACKGROUND OF THE INVENTION 

Potyviruses are a distinct group of plant viruses which are 
pathogenic to various crops. Potyviruses include watermelon mosaic 
virus II (WMVII); papaya ringspot virus strains papaya ringspot and 
watermelon mosaic I (PRV-p and PRV-w) , two closely related members of 
15 the plant potyvirus group which were at one time classified as 
distinct virus types, but are presently classified as different 
strains of the same virus; zucchini yellow mosaic virus (ZYMV) ; and 
many others. These viruses consist of f lexous , filamentous particles 
of dimensions approximately 780 X 12 nanometers. The viral particles 
20 contain a single -stranded RNA genome containing about 10,000 
nucleotides of positive (+, coding, or sense) polarity. Translation 
of the RNA genome of potyviruses shows that the RNA encodes a single 
large polyprotein of about 330 TsD. This polyprotein contains several 
proteins, one of which is a 49kD protease that .is specific for the 
25 cleavage of the polyprotein into at least six (6) other peptides. 
One of the proteins contained within this polyprotein is a 35kD 
capsid or coat protein which coats and protects the viral RNA from 
degradation. 

The genome organization of several viruses belonging to the 
30 potyvirus family group has been studied in detail, in particular 
tobacco etch virus, tobacco vein mottling virus and 'pepper mottle 
virus. In each case, the location of the coat protein gene has been 
at the 3' -end of the RNA, just prior to a stretch of (200 to 300 
bases) terminal adenine nucleotides residues. The location of the 49 
35 kD protease gene appears to be conserved in these viruses. In the 
tobacco etch virus, the protease cleavage site has been determined to 
be the dipeptide Gln-Ser, Gln-Gly or Gin- Ala. Conservation of these 
dipeptides as the cleavage sites in these viral polyproteins is 
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apparent from the sequences of the above-listed potyviruses. 

Expression of the coat protein genes from tobacco mosaic virus, 
alfalfa mosaic virus , cucumber mosaic virus , and potato virus X in 
transgenic plants has resulted in plants which are resistant to 
5 infection by the respective virus . In order to produce such 
transgenic plants , the coat protein gene must be inserted into the 
genome of the plant. Furthermore, the coat protein gene must contain 
all the genetic control sequences necessary for the expression of the 
gene after it has been incorporated into the plant genome. 

10 Since the coat protein of a potyvirus is produced by the post 

translational processing of a polyprotein, the coat protein gene 
isolated from viral RNA does not contain the genetic regulatory 
sequences needed for gene expression. The coat protein gene does not 
contain the transcription and translation signals necessary for its 

15 expression once transferred and integrated into a plant genome. It 
must, therefore, be engineered to contain a plant expressible 
promoter, a translation initiation codon (ATG) and a plant functional 
poly (A) addition signal (AATAAA) 3' of its translation termination 
codon. 

20 In the present invention, the nucleotide sequences of the coat 

protein genes for WMV-II, PRV-p and ZYMV have been determined, and 
the genes have been Inserted into expression vectors to supply them 
with the necessary genetic regulatory sequences so that the genes can 
be expressed when incorporated into a plant genome. Plant cells are 

25 transformed with the vector construct and the plant cells are induced 
to regenerate. The resulting plants contain the coat protein genes 
and produce the coat protein. The production of the protein confers 
upon the plant an increased resistance to infection by the virus from 
which the coat protein gene was derived. 

30 INFORMATION DISCLOSURE 

European patent application EP 0 223 452 describes plants that 
are resistant to viral diseases and methods for producing them. The 
process described comprises the steps of transforming a plant with a 
DNA insert comprising a promoter, a DNA sequence derived from the 

35 virus, and a poly (A) addition sequence. 

PCT patent application PCT/US86/00514 refers generally to a 
method of conferring resistance to a parasite to a host of the 
parasite . 
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Allison et al. (1985) "Biochemical Analysis of the Capsid 
Protein Gene and Capsid Protein of Tobacco Etch Virus: N-Terminal 
Amino Acids Are Located on the Virion's Surface", Virology 
147:309-316, describe the nucleotide sequence at the 3' end of the 
5 tobacco etch virus genome encoding the capsid protein. Homology to 
the sequence encoding the capsid protein of Pepper mottle virus is 
reported. 

Allison et al. (1986) "The Nucleotide Sequence of the Coding 
Region of Tobacco Etch Virus Genomic RNA: Evidence for the Synthesis 

10 of a Single Polyprotein" , Virology 154:9-20 describe the genome 
organization of the tobacco etch virus. 

Carrington, J.C. and Dougherty, W.G. (1987) "Small nuclear 
inclusion protein encoded by a plant potyvirus genome is a protease" , 
J. Virology 61:2540-2548, disclose that the viral RNA of tobacco etch 

15 virus encodes the 49K protease responsible for cleavage of the 
polyprotein produced when the viral RNA is translated. 

Dodds et al. (1985) "Cross Protection between strains of 
cucumber mosaic virus: effect of host and type of inoculum on 
accumulation of virions and double -stranded RNA of the challenge 

20 strain", Virology 144:301-309, describe increased resistance to 
challenge by virus conferred to a plant by infection of a different 
strain of virus. 

Dougherty, W.G. et al. (1985) "Nucleotide Sequence at the 3' 
Terminus of Pepper Mottle Virus Genomic RNA: Evidence for an Alterna- 
25 tive Mode of Potyvirus Capsid Protein Gene Organization" , Virology 
146:282-291, report the nucleotide sequence of the 3' terminus of the 
viral RNA genome of pepper mottle virus. 

Dougherty, W.G. et al. (1988) "Biochemical and mutational 
analysis of plant virus polyprotein cleavage site", EMBO J. 
30 7:1281-1287, describe the conservation of the proteolytic cleavage 
site among geographically distinct isolates of tobacco ejtch virus. 

Dougherty, W. G. and Carrington, J. C. (1988) "Expression and 
function of potyviral gene products", Ann. Rev. Phytopathol. 
26:123-143, describe potyviruses and some of the similarities the 
35 members of the group have with each another. 

Eggenberger, A. L. et al. (1989) "The nucleotide sequence of a 
Soybean Mosaic Virus Coat Protein region and its expression in 
Escherichia coli, Agrobacterium tumefaciens, and tobacco callus", 
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Virology, In press, disclose the nucleotide sequence of the coat 
protein gene for soy bean mosaic virus . 

Hinchee, M. A. W. et al (1988) "Production of transgenic soybean 
plants using Agrobacterium-mediated DNA transfer", Bio/tech. 
5 6:915-921, disclose the production of transgenic soybean plants which 
were transformed with A. tumerfacien plasmids that conferred either 
Kanamycin resistance/^ -glucuronidase activity or Kanamycin 
resistance/glyphosphate tolerance . 

Kozak, M. (1986) "Point mutations define a sequence flanking the 
10 AUG initiator codon that modulates translation by eukaryotic ribo- 
somes", Cell 44:283-292, discloses the optimal sequence around the 
ATG initiator codon of the prepro insulin gene for initiation by 
eukaryotic ribosomes. 

Loesch-Fries et al. (1987) "Expression of alfalfa mosaic virus 
15 RNA 4 in transgenic plants confers virus resistance" , EMBO J 
6:1845-1851, disclose that expression of the coat protein gene of 
alfalfa mosaic virus in transgenic plants confers resistance to 
infection by the virus. 

Pietrzak et al. (1986) "Expression in plants of two bacterial 
20 antibiotic resistant genes after protoplast transformation with a new 
plant expression vector", Nucleic Acids Research 14:5857-5868, dis- 
close expression in plants of foreign genes introduced Into the plant 
using an expression vector containing a movable expression cassette 
consisting of the Cauliflower mosaic virus 35S promoter and 
25 transcription terminator seperated by a polylinker containing several 
unique restriction sites. 

Powell-Abel et al. (1986) "Delay of disease development in 
transgenic plants that express the tobacco mosaic virus coat protein 
gene", Science 232:738-743, disclose increased resistance toward 
30 infection by tobacco mosaic virus in transgenic plants containing the 
coat protein gene from tobacco mosaic virus. 

Quemada, H. D. et al. (1989) "The nucleotide sequences of cDNA 
clones from RNA3 of Cucumber Mosaic Virus strains C and WL" , J . Gen. 
Virol. 70:1065-1073, reports the nucleotide sequences of cDNA clones 
35 from RNA3 of Cucumber Mosaic Virus strains C and WL and compares them 
to each other and other strains for homology. 

Shukla, D. D. et al. (1986) "Coat Proteins of Potyviruses" , 
Virology 152:118-125, discloses the amino acid sequence of the potato 
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virus Y coat protein. 

Shukla, D. D. et al. (1988) "The N and C termini of the Coat 
Proteins of Potyviruses Are Surface -located and the N Terminus 
Contains the Major Virus -specific Epitopes", J. Gen. Virol. 
5 69:1497-1508, disclose that the N- and C- termini regions of some 
potyvirus coat proteins are located at the surface of the viral 
particles. The viral particles were treated with trypsin and it was 
observed that the enzyme treatment removed 30-67 amino acids from the 
N- terminal and 18-20 amino acids from the C- terminal; the variations 

10 were dependent on the virus. The remaining portion of the coat 
protein, the core, was highly conserved among the various viruses. 

Turner et al. (1987) "Expression of alfalfa mosaic virus coat 
protein gene confers cross -protection in transgenic tobacco and 
tomato plants", EMBO J. 6:1181-1188, disclose transgenic tobacco and 

15 tomato plants transformed with the coat protein gene of alfalfa 
mosaic virus display increased resistance to infection by alfalfa 
mosaic virus. 

Yeh and Gonsalves (1985) "Translation of Papaya Ringspot Virus 
RNA in vitro: Detection of a Possible Polyprotein That is Processed 
20 for Caps id Protein, Cylindrical -Indus ion Protein, and Amor- 
phous-Inclusion Protein", Virology 143:260-271, describe the 
possibility that the RNA genome encodes a single proprotein which 
undergoes post- translational processing to form the potyvirus protein 
products . 

25 The following scientific publications are of interest but not 

relevant : 

An et al. (1985) "New cloning vehicles for transformation of 
higher plants", EMBO J. 4:277-285 describe the construction of an 
expression plasmid which may be stably replicated in both E. coli and 

30 A. tumerf aciens . 

An, G. (1986) "Development of plant promoter expression vectors 
and their use for analysis of differential activity of nopaline 
synthase promoter in transformed tobacco cells", Plant Physiol. 
81:86-91, reports differences in promoter activities of transferred 

35 genes within the same cells as well as in independently derived cell 
lines . 

Bevan et al . (1983) "Structure and transcription of the nopaline 
synthase gene region of T-DNA" , Nucleic Acids Research 11:369-385, 
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disclose the DNA sequence and plant- tumor transcription pattern of a 
portion of DNA from A. tumerfaciens strain T37. 

Depicker et al. (1982) "Nopaline synthase: transcript mapping 
and DNA sequence", J. Mol. Appl. Genet. 1:561-573, disclose the DNA 
5 sequences 5' and 3' to the nos gene found in A. tumerfaciens. 

Hepburn, A. et al. (1985) "The use of pNJSOOO as an intermediate 
vector for genetic manipulation of Agrobacterium Ti-plasmids" , J. 
General Microbio. 131:2961-2969, describe vectors which are used to 
transfer narrow host range vectors from E. coli to A. tumerfaciens. 
10 Klein et al. , (1987) "High-velocity microproj ec tiles for 

delivering mucleic acids into living cells", Nature 327:70-73, 
disclose that nucleic acids may be delivered into living cells using 
accelerated, small tungsten balls which pierce the cells without 
killing them. 

15 Klein et al. , (1988) "Factors influencing gene delivery into Zea 

mays cells by high-velocity microproj ectiles" , Bio/tech. 6:559-563, 
disclose that two days after bombarding plant cells with a plasmid 
coated microproj ectile, expression of an gene encoding an enzyme 
could be detected. 

20 Mazur, B. J. and Chui, C.-F. (1985) "Sequence of a genomic DNA 

clone for the small subunit of ribulose bis -phosphate 
carboxylase-oxygenase from tobacco" , Nucleic Acids Research 
13:2373-2386, disclose the DNA sequence of the small subunit of 
ribulose bis -phosphate carboxylase -oxygenase from tobacco. 

25 McCabe, D. E. , et al., (1988) "Stable transformation of soybean 

(Glycine max) by particle acceleration", Bio/tech. 6:923-926, 
disclose expression in soybean shoots of foreign genes introduced 
into immature soybean seeds using DNA coated microproj ectiles . 

Olson, M. K. et al (1989) "Enhancement of heterologous 

30 polypeptide expression by alterations in the ribosome-binding-site 
sequence", J. Biotech. 9:179-190, discloses the increase in gene 
expression of heterologous genes in E. coli due to the presence of an 
AT -rich 5' untranslated region. 

Slightom et al. (1983) "Complete nucleotide sequence of a French 

35 bean storage protein gene: Phaseolin" , Proc. Natl. Acad. Sci. U.S.A. 
80:1897-1901, disclose the complete nucleotide sequences of the gene 
and the mRNA coding for a specific phaselin type French bean major 
storage protein. 
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Vilaine, F. and Casse-Delbart , F. (1987) "Independent induction 
of transformed roots by the TL and TR regions of the Ri plasmid of 
agropine type Agrobacterium rhizogenes", Mol. Gen. Genet. 206:17-23, 
disclose the respective role of Tl- and TR-DNA in root induction by 
5 agropine type Agrobacterium rhizogenes Ri plasmids . 

None of these documents, either alone or taken together, teaches 
or suggests the instant invention which relates to potyvirus coat 
protein genes and plants transformed therewith. 

SUMMARY OF THE INVENTION 
10 The present invention relates to the coat protein genes of 

Papaya Ringspot Virus Strain papaya ringspot (PRV-p) , Watermelon 
Mosaic Virus II (WMVII) , and Zucchini Yellow Mosaic Virus (ZYMV) . 

The present invention relates to a recombinant DNA molecule 
which encodes a potyvirus coat protein. The present invention 
15 relates to a recombinant DNA molecule comprising a potyvirus coat 
protein gene operably linked to genetic regulatory sequences 
necessary for gene expression. 

The present invention relates to expression vectors which 
contain a coat protein gene for potyviruses, and, additionally, the 
20 necessary genetic regulatory sequences needed for expression of a 
gene transferred into a plant. The present invention also relates to 
bacterial or plant cells which are transformed with an expression 
vector containing the coat protein genes. Furthermore, the present 
invention relates to transgenic plants which are. produced from plant 
25 cells transformed with an expression vector containing the coat 
protein gene from potyviruses. In addition, the present invention 
relates to a process of producing transgenic plants which have 
increased resistance to viral infection. 

DETAILED DESCRIPTION OF THE INVENTION 
30 Charts 1, 2 and 3 contain DNA nucleotide sequences of the coat 

protein genes of PRV-p, WMVII and ZYMV, respectively. Charts 4 and 5 
compare the nucleotide sequences of various coat protein genes. 
Charts 6-14 are set forth to illustrate the constructions of this 
invention. Certain conventions are used to illustrate plasmids and 
35 DNA fragments as follows: 

(1) The single line figures represent both circular and linear 
double -stranded DNA. 

(2) Asterisks (*) indicate that the molecule represented is 
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circular. Lack of an asterisk indicates the molecule is 
linear . 

(3) Junctions between natural boundaries of functional com- 
ponents are indicated by vertical lines along the 

5 horizontal lines. 

(4) Genes or functional components are indicated below the 
horizontal lines. 

(5) Restriction sites are indicated above the horizontal lines. 

(6) Distances between genes and restriction sites are not to 
10 scale. The figures show the relative positions only unless 

indicated otherwise. 

(7) The following abbreviations are used to denote function and 
components : 

a) P ca - CaMV35S promoter; 
15 b) I c CMV intergenic region, the intergenic region 

comprising the initiation codon and AT rich 5' 
untranslated region; 

c) S ca - CaMV35S poly (A) addition signal; and 

d) Nos = Nos nptll gene. 

20 Most of the recombinant DNA methods employed in practicing the 

present invention are standard procedures, well known to those 
skilled in the art, and described in detail in, for example, European 
Patent Application Publication Number 223452, published November 29, 
1986, which is incorporated herein by reference. Enzymes are 
25 obtained from commercial sources and are used according to the 
vendor's recommendations or other variations known in the art. 
General references containing such standard techniques include the 
following: R. Wu, ed. (1979) Methods in Enzymology, Vol. 68; J. H. 
Miller (1972) Experiments in Molecular Genetics; T. Maniatis et al. 
30 (1982) Molecular Cloning: A Laboratory Manual; D. M. Glover, ed. 
(1985) DNA Cloning Vol. II; H.G. Polites ; and K.R. Marotti (1987) "A 
step-wise protocol for cDNA synthesis".; Biotechniques 4:514-520; 
S.B. Gelvin and R.A. Schilperoort , eds . Introduction, Expression, 
and Analysis of Gene Products in Plants, all of which are 
35 incorporated by reference. 

For the purposes of the present disclosure the following 
definitions apply. 

"Promoter" means a promoter which is functional in the host 
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plant. 

"Initiation region" includes the initiation codon and 
nucleotides flanking the initiation codon. 

"Operably linked" refers to the linking of nucleotide regions 
5 encoding specific genetic information such that the nucleotide 
regions are contiguous, the functionality of the region is preserved 
and will perform its function relative the the other regions as part 
of a functional unit. 

"AT rich 5' untranslated region" is a nucleotide sequence 
10 composed of at least 60% adenine or thymine nucleotides. 

"Untranslated flanking region" refers to nucleotide sequences 
which are 3' of the termination codon and end at the poly (A) addition 
signal. These sequences enhance production of the peptide encoded by 
the upstream gene. 

15 "Vector" is a vehicle by means of which DNA fragments can be 

introduced into host organisms . 

"Expression vector" is a vehicle by means of which DNA fragments 
that contain sufficient genetic information and can, therefore, be 
expressed by the host, can be introduced into host organisms. 

20 "Antipathogen gene" is a gene which encodes a DNA sequence which 

is either the antisense sequence of a pathogenic gene or the 
antipathogenic gene encodes a peptide whose presence in an organism 
confers an increased resistence to a pathogen. 

To .practice the present invention, the coat protein gene of a 

25 virus must be isolated from the viral genome and inserted into a 
vector containing the genetic regulatory sequences necessary to 
express the inserted gene. Accordingly a vector must be constructed 
to provide the regulatory sequences such that they will be functional 
upon inserting a desired gene. When the expression vector/insert 

30 construct is assembled, it is used to transform plant cells which are 
then used to regenerate plants. These transgenic plants carry the 
viral gene in the expression vector/insert construct. The gene is 
expressed in the plant and increased resistance to viral infection is 
conferred thereby. 

35 Several different courses exist to isolate the coat protein 

gene. To do so, one having ordinary skill in the art can use 
information about the genome organization of potyviruses to locate 
and isolate the coat protein gene. The coat protein gene is located 
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at the 3' end of the RNA, just prior to a stretch of about 200-300 
adenine nucleotide residues. Additionally, the information related 
to proteolytic cleavage sites is used to determine the N- terminus of 
the potyvirus coat protein gene. The protease recognition sites are 
5 conserved in the potyviruses and have been determined to be either 
the dipeptide Gln-Ser, Gln-Gly or Gin-Ala. The nucleotide sequences 
which encode these dipep tides can be determined. 

Using methods well known in the art, a quantity of virus is 
grown up and harvested. The viral RNA is then seperated and the coat 
10 protein gene can be isolated using a number of known procedures. A 
cDNA library is created using the viral RNA. The methods followed to 
do this are well known in the art. The viral RNA is treated with 
reverse transcriptase and a complementary DNA molecule is produced. 
A DNA complement of the complementary DNA molecule is produced and 
15 that sequence represents a DNA copy of the original viral RNA 
molecule. Thus, a double stranded DNA molecule is generated which 
contains the sequence information of the viral RNA. These DNA 
molecules can be ' cloned in E. coli plasmid vectors after the 
additions of restriction enzyme linker molecules by DNA ligase. The 
20 various fragments are inserted into cloning vectors which are then 
used to transform E. coli and create a cDNA library. 

Since the coat protein gene is located just 5' to the polyA 
region, oligonucleotides that can hybridize to the polyA region can 
be used as hybridization probes to screen the cDNA library and 
25 determine if any of the transformed bacteria contain DNA fragments 
with sequences coding for the polyA region. The cDNA inserts in any 
bacterial colonies which contain this region can be sequenced. The 
coat protein gene is present in its entirety in colonies which have 
sequences that extend 5' to the sequence which encodes the proteo- 
30 lytic cleavage site described above. 

Alternatively, cDNA fragments may be inserted into expression 
vectors. Antibodies against the coat protein may be used to screen 
the cDNA expression library and the gene can be isolated from 
colonies which express the protein. 
35 Using the sequences disclosed in Charts 1, 2 and 3, the coat 

protein genes for the respective viruses may be synthesized 
chemically by methods well known in the art. Alternatively, the 
information in Charts 1, 2 and 3 may be" used to synthesize 
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oligonucleotides which can be used as probes to .screen a cDNA 
library. 

The nucleotide sequences of the coat protein genes for WMV-II, 
PRV-p and ZYMV have been determined and the genes have been inserted 
into expression vectors. The expression vectors contain the 
necessary genetic regulatory sequences for expression of an inserted 
gene. The coat protein gene is inserted such that those regulatory 
sequences are functional so that the genes can be expressed when 
incorporated into a plant genome. 

In order to express the viral gene, the necessary genetic 
regulatory sequences must be provided. Since the coat protein. of a 
potyvirus is produced by the post translational processing of a 
polyprotein, the coat protein gene isolated from viral RNA does not 
contain the genetic regulatory sequences needed for gene expression. 
15 The coat protein gene does not contain the transcription and 
translation signals necessary for its expression once transferred and 
integrated into a plant genome. It must, therefore, be engineered to 
contain a plant expressible promoter, a translation initiation codon 
(ATG) and a plant functional poly (A) addition signal (AATAAA) 3' of 
20 its translation termination codon. In the present invention, the 
coat protein is inserted into a vector which contains a cloning site 
for insertion 3' of the initiation codon and 5' of the poly (A) 
signal. The promoter is 5' of the initiation codon such that when a 
structural gene is inserted at the cloning site, a functional unit is 
25 formed in which the inserted gene is expressed under the control of 
the various genetic regulatory sequences. 

In the preferred embodiment of the present invention, additional 
genetic regulatory sequences are provided. As described above, an 
expression vector must contain a promoter, an intiation codon and a 
30 poly (A) addition signal. In order to get a higher level of 
expression, untranslated regions 5' and 3' to the inserted genes are 
provided. Furthermore, certain sequences flanking the initiation 
codon optimize expression. The promoter used is one that is chosen 
for high level expression. 
35 A 5' untranslated region which results in higher level expres- 

sion of an inserted gene is provided downstream from the promoter and 
upstream from the initiation codon. This region contains at least 
60% of the sequence a Adenine and Thymine. There is a statistical 
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bias for expression when such an AT rich region is located between 
the promoter and intiation codon. This preference is utilized in the 
preferred embodiment of the present invention by inclusion of an AT 
rich 5' untranslated region intermediate of the promoter and initi- 
5 at ion codon. 

The preferred embodiment of the present invention also contains 
specific nucleotide sequence flanking the initiation codon. This 
preferred sequence, termed Kozak's element, is AAXXATGG wherein X 
represents any of the four nucleotides. The presence of the ? 
10 initiation codon following Kozak's rule results in higher level 
expression when used in an expression vector. In the preferred 
embodiment of the present invention, the small subunit from the SS 
RUBISCO contains an initiation codon in which Kozak's element is 
used. 

15 Furthermore, the preferred embodiment of the present invention 

contains a 3' untranslated region downstream from the cloning site 
where the coat protein gene is inserted and upstream from the poly (A) 
addition signal. The sequence of this 3' untranslated region results 
in a statistical bias for protein production. The sequence promotes 

20 high level expression. The poly(A) addition signal is found directly 
downstream from the 3' untranslated region and can be derived from 
the same source. In the preferred embodiment of the present 
invention, the 3' untranslated region and poly (A) addition signal are 
derived from CaMV 35S gene or the phaseolin seed storage protein 

25 gene. 

The poly (A) addition signal from CaMV, nopaline synthase, 
octopine synthase, bean storage protein, and SS RUBISCO genes are 
also suitable for this construction. Several promoters which 
function in plants are available, but the best promoters are the 

30 constitutive promoter from cauliflower mosaic virus (CaMV, a plant ? 
DNA virus) and the small subunit of ribulose bis -phosphate 
carboxylase- oxygenase (SS RUBISCO) gene. 

Using methods well known to those skilled In the art, plant 
cells are transformed with the vector construct and the plant cells 

35 are induced to regenerate. The resulting plants contain the coat 
protein genes and produce the coat protein. The production of the 
protein confers upon the plant an increased resistance to infection 
by the virus from which the coat protein gene was derived. 
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DESCRIPTION OF THE PREFERRED EMBODIMENTS 
Example 1 Isolation of WMVII RNA 

Watermelon mosaic virus II (WMV II) was propagated in zucchini 
squash (Cucurbita pepo L) plants and RNA was isolated by the method 
5 described by Yeh and Gonsalves (Virology 143:260, 1985). 
Example 2 Isolation of PRV-p RNA 

Papaya ringspot virus strain prv (PRV-p) was propagated in jelly 
melon, Cucumis metuliferus (Nand.) Mey. Acc. 2549 plants and RNA was 
isolated by the method described by Yeh and Gonsalves (Virology 

10 143:260, 1985). 

Example 3 Isolation of ZYMV RNA 

Zucchini yellow mosaic virus (ZYMV) was propagated in zucchini 
squash (Cucurbita pepo L) plants and RNA was isolated by the method 
described by Yeh and Gonsalves (Virology 143:260, 1985). 
15 Example 4 Synthesis of double -stranded cDNA 

The procedure used to make double stranded cDNA from isolated 
viral RNA is the same for all viral RNA isolated above. The purified 
RNA was subjected to the cDNA synthesis protocol described by Polites 
and Marotti (Biotechniques 4:514, 1986) and because this RNA contains 
20 an A-rich region at its 3' -end (similar to that found for many 
eukaryotic mRNAs) the procedure was straight- forward. The synthesis 
of double stranded cDNA was also done as described by Polites and 
Marotti. After double -stranded cDNA was synthesized, it was purified 
by passage through a G-100 Sephadex column, precipitated with 
25 ethanol, and suspended in 20 ftl of 10X EcoRI methylase buffer (100 mM 
NaCl, 100 mM Tris-HCl, pH 8.0, 1 mM EDTA, 80 fM S-adenosyl 
methionine, and 100 ftg/ml bovine serum albumin). An additional 
amount of S-adenosyl methionine (1 y.1 of a 32 mM solution) was added 
to the reaction mixture, followed by the addition of 1 pi (20 units) 
30 EcoRI methylase. The reaction was incubated at 37°C for 30 minutes 
and stopped by incubation at 70° C for 10 minutes. Then 1 M l (5 
units) of E. coli DNA polymerase I Klenow fragment was added and 
incubated at 37 °C for 10 minutes, followed by phenol/chloroform 
extraction and ethanol precipitation. The pellet was washed in 70% 
35 ethanol, then 70% ethanol/0.3 M sodium acetate. The pellet was dried 
and resuspended in 8 y.1 of 0- 5 phosphorylated EcoRI linkers 

(Collaborative Research, Inc., 128 Spring St., Lexington, MA 02173). 
One a*1 of 10X ligase buffer (800 mM Tris-HCl ph 8.0, 200 mM MgCl 2 . 
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150 mM DTT, 10 mM ATP) and 1 fil of T4 DNA ligase (4 units) were 
added, and the reaction was incubated overnight at 15°C. The 
ligation reaction was stopped by incubation at 65°C for 10 minutes. 
Sixty fil of H 2 0, 10 fil of 10X EcoRI salts (900 mM Tris-HCl pH 8.0, 
5 100 mM MgCl 2 , 100 mM NaCl) , and 10 fil of EcoRI (10 units//xl) were 
added, and the reaction was incubated at 37 °G for 1 hour. The 
reaction was stopped by phenol/chloroform and chloroform extractions. 
The reaction mixture was then size fractionated by passage through a 
Sephadex G-100 column and the fractions containing the largest double 
10 stranded cDNA molecules were concentrated by butanol extractions, 
precipitated with ethanol, and resuspended in 10 fil of H 2 0. Five fil 
of the double stranded cDNAs was added to 0.5 fig of pUC19 DNA (which 
had been previously treated with phosphatase to remove the 5' 
phosphates), 1 fil of 10X ligase buffer, and 1 fil of T4 ligase, and 
15 the reaction was incubated at 15° C for 16 hours. The resulting 
ligated pUC19-coat protein gene double stranded cDNA molecules were 
transformed into E.coli host cells as described by the manufacturer 
(Bethesda Research Laboratories, Inc., Gaithersburg, MD 20877) and 
plated on medium containing 50 /ig/ml ampicillin, 0.04 mM IPTG, and 
20 0.004% X-Gal. Bacterial colonies showing no blue color were selected 
for further analysis. Clones containing the 3' -region and possibly 

■JO 

the coat protein gene were identified by hybridization against a ->*P- 
labeled oligo-dT. Bacterial colonies showing hybridization to this 
probe should contain at least the poly(A) region of the particular 
25 potyvirus genome. Several of the hybridizing bacterial clones were 
selected and plasmid DNAs were isolated according to methods known to 
those skilled in the art. 

Example 5 Identification of the PRV-p Coat Protein Gene 

Several of the cloned cDNAs of PVP-p RNA were sequenced by the 

30 chemical DNA sequencing method described by Maxam and Gilbert 
(Methods of Enzymology 65:499, 1980). Based on this information and 
comparative analysis with other potyviruses clone number pPRV-117 was 
found to contain a complete copy of the PRV-p coat protein gene. The 
N- terminus of the coat protein was identified by the location of the 

35 dipeptide sequence Gln-Ser. The length of the PRV-p coat protein 
gene coding region is consistent with a gene encoding a protein of 
about 33 kDal. The sequence of the PRV-p coat protein gene and 
protein are shown in Chart 1. In addition, comparison of this 
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sequence with that of the related virus PRV-w described by Nagel and 
Heibert (Virology 143:435, 1985) shows that the two coat protein 
genes share 98% nucleotide and amino acid similarities (Chart 4). 
Because these two viruses share extensive identities in their coat 
5 proteins, expression of the coat protein gene from PRV-p is expected 
to yield plants resistant to both PRV-p and PRV-w. 

Example 6 . Construction of a Plant-expressible PRV-p Coat Protein 
Gene Cassette with CaMV 35S Promoter and Polyadenylation Signal and 
CMV 5' Untranslated Region and Translation Initiator ATG. 

10 Attachment of the necessary plant regulatory signals to the PRV- 

p coat protein gene was accomplished by constructing a translational 
fusion with a clone originally designed for the expression of the CMV 
coat protein gene, using clone P UC1813/CP19 . Plasmid P UC1813/CP19 is 
a vector suitable for agrobacterium mediated gene transfer. An 

15 EcoRI-EcoRI fragment was removed from pDH51/CP19 and placed into the 
EcoRI site of the plasmid, pUC1813 (available from Robert K. , 
Department of Chemistry, Washington State University, Pullman, 
Washington), creating plasmid pUC1813/CP19 . Plasmid P UC1813/CP19 was 
described in U.S. patent application Serial Number 07/135,591 filed 

20 on December 21, 1987 incorporated herein by reference. This trans- 
lational fusion clone was constructed by first identifying two 
restriction enzyme sites within clone pUC1813/CP19 . One site (Tthlll 
I) is located between amino acids 13 to 17 while the other site (BstX 
I) is located near the end of the 3' -untranslated region of the CMV 

25 coat protein gene. 

Addition of these specific restriction enzyme sites to the PRV-p 
coat protein gene was accomplished by the polymerase chain reaction 
technique, using an instrument and Taq polymerase purchased from 
Perkin Elmer-Cetus, Emeryville, Ca. Specifically, two respective 5' 

30 and 3' oligomers (CGACGTCGTCAGTCCAAGAATGAAGCTGTG , containing a Tthlll 
I site and ( CCCACGAAAGTGGGGTGAAACAGGGTCGAGTCAG , containing a BstX I 
site), sharing at least 20 nucleotides with the PRV-p coat protein 
gene were used to prime synthesis and gene amplification of the coat 
protein gene. After synthesis, the amplified fragments were digested 

35 with Tthlll I and BstX I to expose the sites. 

As shown in Chart 6, P UC1813/CP19 is the expression vector which 
contains the CMV coat protein gene. Plasmid P UC1813/CP19 contains 
Tthlll I and BstX I sites. 
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The digested, amplified fragments are ligated into the respec- 
tive exposed sites of pUC1813/CP19 and the expected new construction 
was identified using methods known to those skilled in the art. 
Polymerase chain reaction techniques were used to amplify PRV-P coat 
5 protein gene containing the Tthllll and BstXI sites. The plasmid 
pUC1813/CP19 and PRV-P coat protein gene fragments were digested with 
Tthllll and BstXI and ligated to each other. The resulting clone, 
designated pUC1813/CP19-PRVexp, was subjected to nucleotide 
sequencing to ensure that the cloning and gene amplification did not 
10 introduce any detrimental artifacts. The sequence showed no arti- . 
facts, suggesting that this plant expression cassette should be 
capable of expressing a PRV-p coat protein gene which contains an 
additional 16 amino acids of CMV coat protein at its N- terminus. 
Example 7 Construction of a Micro T-DNA Plasmid Containing the 

15 Plant-expressible PRV-p Coat Protein Gene Construction. 

As depicted in Chart 7, the plant expression cassette for the 
PRV-p coat protein gene was transferred into a suitable micro T-DNA 
vector which contains the necessary Agrobacterium T-DNA transfer 
signals for transfer from an Agrobacterium and integration into a 
20 plant genome, and a wide host-range origin of replication (for 
replication in Agrobacterium). Plasmid pUCl813/CP19-PRVexp was 
digested with Hind III and the resulting 2.2 kb insert fragment 
containing the plant- expressible cassette was removed and ligated 
into the Hind III site of the modified Agrobacterium- derived micro- 
25 vector pGA482 (modification included the addition of the f}- 
glucuronidase gene). The micro T-DNA vector, pGA482, is available 
from G. An, Institute of Biological Chemistry, Washington State 
University, Pullman, WA. The resulting plasmid was designated, 
pGA482/G/CP19-PRVexp and is shown in Chart 7. This plasmid (or 
30 derivatives thereof) was transferred into virulent or avirulent 
strains of Agrobacterium tumef aciens or rhizogenes , such as A208 , 
C58, LBA4404, C58Z707, A4RS, A4RS(pRiB278b) , and others. Strains 
A208 C58, LBA4404, and A4RS are available from American Type Culture 
Collection (ATCC) , 12301 Parklawn Drive, Rockville, MD. Bacteria 
35 A4RS(pRiB278b)is available from Dr. F. Casse-Delbart, C.N.R.A. , 
Routede Saint Cyr. F78000, Versailles, France. Strain C58Z707 is 
available from Dr. A.G.Hepburn, Dept. of Agronomy, University of 
Illinois, Urbana, IL. 
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After transfer of the engineered plasmid pGA482/G/CP19 -PRVexp 
into any of the above listed Agrobacterium strains, these Agro- 
bacterium strains can be used to transfer and integrate within a 
plant genome the plant-expressible PRV-p coat protein gene contained 
5 within its T-DNA region. This transfer can be accomplished using the 
standard methods for T-DNA transfers which are known to those skilled 
in the art, or this transfer can be accomplished using the methods 
described in a U.S. patent application Serial Number 07/135,655, 
filed on December 21, 1987, entitled "Agrobacterium Mediated 
10 Transformation of Germinating Plant Seeds" and incorporated herein by 
reference . 

Example 8 Construction of a Plant-expression Cassette for 

Expression of Various Genes in Transgenic Plants. 

In the preferred embodiment of the present invention, the 

15 following expression cassette was constructed to provide the 
necessary plant regulatory signals (which include the addition of a 
promoter, 5' untranslated region, translation initiation codon, 
and polyadenylation signal) to the gene inserts in order to achieve 
high level expression of the inserts. The expression cassette may be 

20 used to express any genes inserted therein. Accordingly, the 
applicability- of the expression cassette goes beyond its use in 
expressing coat protein genes. Rather, the expression cassette may 
be used to express any desired protein in transgenic plants. The 
expression cassette is the preferred expression system for expressing 

25 viral coat protein genes in plants. 

The expression cassette of the preferred embodiment contains: a 
constitutive promoter; a 5' untranslated region which enhances gene 
expression; an initiation codon which comprise Kozak's element; a 
cloning site where the gene to be expressed may be inserted to 

30 produce a functional expression unit; and a 3' untranslated region 
which comprises a poly(A) addition signal and untranslated flanking 
regions which result in a higher level of expression. 

More specifically, the expression cassette which is the 
preferred embodiment of the present invention consists of the 

35 cauliflower mosaic virus (CaMV) 35S transcript promoter, the 5'- 
untranslated region of cucumber mosaic virus (CMV) , the CMV 
translation initiation codon, and the CaMV polyadenylation signal. 
The construction of this expression cassette utilized the Polymerase 
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Chain Reaction (PGR) technique to obtain correct position of the 
plant regulatory signals and the addition of convenient restriction 
enzyme sites which allow for the easy addition of a coat protein gene 
and the excision of the completed cassette so it can be transferred 
5 to other plasmids . 

To accomplish the construction of this expression cassette the 
following oligomers were synthesized: 

1. 5' -GAAGCTTCCGGAAACCTCCTCGGATTCC-3 ' , contains a Hindlll site 
at its 5' -end and contains 21 bases which are identical to 21 bases 

' 10 in the 5 '-flanking region of CaMV. 

2. 5 ' -GCCATGGTTGACTCGACTCAATTCTACGAC-3' , contains a Ncol site 
at its 5' -end which contains a translation initiation codon which 
conforms to Kozak's rules and has 21 bases which are identical to 21 
bases in the antisense strand of the CMV 5 ' -untranslated region. 

15 3. 5 ' -GCCATGGTTGCGCTGAAATCACCAGTCTC-3 ' , contains a Ncol site at 

its 5 '-end (which contains the same translation initiation codon as 
oligomer 2) and has 20 bases which are identical to 20 bases in the 
3 ' -untranslated region of CaMV. 

4. 5 ' -GAAGCTTGGTACCACTGGATTTTGGTT-3 ' , contains a Hindlll site 
20 at its 3' -end and has a 20 base match with the flanking DNA region 3' 
of the CaMV polyadenylation site (on the antisense strand) . 

These oligomers were used to amplify sequences contained within 
the CMV expression clone referred to as pUC1813/CP19 , shown in Chart 
6, and referred to above. As depicted in Chart 8, the PCR technique 
25 was used to amplify the gene regulatory regions in pUC1813/CP19 . 
Amplification of the 5' -flanking, CMV 5 ' -untranslated region, and CMV 
initiation codon (which was modified to conform to Kozak's rule 
AAXXATGG) resulted in a fragment of about 400 base pairs in length 
and amplification of the CaMV 3 -untranslated and flanking regions 
30 resulted in a fragment of about 200 base pairs in length. These 
fragments were digested with Ncol and Hindlll, isolated from a 
polyacrylamide gel, and then ligated into Hindlll digested and 
phosphatase treated pUC18. The resulting clone is referred to as 
pl8CaMV/CMV-exp and is shown in Chart 8. 
35 Example 9 Identification of the WMVII Coat Protein Gene 

The cloned WMVII cDNA insert from clone pWMVII-41-3 . 2 which was 
produced as described above, was sequenced by using both the chemical 
(Maxam and Gilbert, Methods of Enzymology 65:499, 1980) and enzymatic 
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10 



(Sanger et al. , Proc. Natl. Acad. Sci. U.S.A. 74:5463, 1977) methods. 
Based on this information and comparative analysis with other 
potyviruses, the nucleotide sequence of clone pWMVII-41-3 . 2 was found 
to contain a complete copy of the WMVII coat protein gene. The N- 
terminus of the coat protein was suggested by the location of the 
dipeptide sequence Gln-Ser. The length of the WMVII coat protein 
gene coding region (281 amino acids) is consistent with a gene 
encoding a protein of about 33 kD. The sequences of this WMVII coat 
protein gene and protein are shown in Chart 2. In addition, 
comparison of this sequence with that obtained from the related virus 
Soybean Mosaic Virus (SMV) strain N described by Eggenberger et al. 
shows that they share overall about 88% identity and excluding the N- 
terminal length differences they share about 92.5% identity, see 
Chart 5. Because these two virus coat proteins share extensive amino 
15 acid identities, expression of the coat protein gene from WMVII is 
expected to yield plants resistant to WMVII infection and could yield 
plants resistant to SMV infection. 

Example 10 Construction of a Plant-expressible WMVII Coat Protein 
Gene Cassette with CaMV 35S Promoter and Polyadenylation Signal and 

20 CMV Intergenic Region and Translation Initiator ATG. 

As depicted in Chart 9, attachment of the necessary plant 
regulatory signals to the WMVII coat protein gene was accomplished by 
using the PCR technique to amplify the WMVII coat protein gene using 
oligomers which would add the necessary s.ites to its 5' and 3' 

25 sequences. Following this amplification the resulting fragment is 
digested with the appropriate restriction enzyme and cloned into the 
Ncol site of the above described expression cassette containing 
plasmid, pl8CaMV/CMV-exp . Clones containing the WMVII coat protein 
gene insert need only be checked to determine correct orientation 

30 with respect with the CaMV promoter. However, to ensure that no 
artifacts have been incorporated during the PCR amplification the 
entire coat protein gene region is checked by nucleotide sequence 
analysis . 

To obtain the amplified WMVII coat protein gene with Ncol 
35 restriction enzyme sites on both ends the following two oligomers 
were synthesized: 

1. 5 ' -ACCATGGTGTCTTTACAATCAGGAAAAG-3 ' , which adds a Ncol site 
to the 5' -end of the WMVII coat protein gene and retains the same ATG 
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translatlon start codon which is present in the expression cassette, 
pl8CaMV/CMV-exp. 

2. 5' -ACCATGGCGACCCGAAATGCTAACTGTG-3 ' , which adds a Ncol site 
to the 3' -end of the WMVII coat protein gene, this site can be 
5 ligated into the expression cassette, pl8CaMV/CMV-exp . 

The cloning of this PCR WMVII coat protein gene, using these two 
oligomers, into pl8CaMV/CMV-exp yields a plant expressible WMVII gene 
(referred to as pl8WMVII-exp) which, following transcription and 
translation, will generate a WMVII coat protein which is identical to 
10 that derived from the WMVII coat protein gene nucleotide sequence, 
see Chart 2. However, this coat protein will differ, because of 
necessary genetic engineering to add the ATG initiation codon and by 
including the last four amino acids of the 54 kD nuclear inclusion 
protein (which is adjacent to the Glu-Ser protease cleavage site); 
15 the amino acids added are Val-Ser-Leu-Glu-N-ter WMVII. The addition 
of these four amino acid residues should not affect the ability of 
this coat protein to yield plants which are resistant to WMVII 
infections, because the N- terminal region of potyvirus coat proteins 
appear not to be well conserved for either length or amino acid 
20 identity. However, if this is found to be a problem its replacement 
would involve the use of a different oligomer to obtain N- terminal 
variations of the WMVII coat protein gene. The cloned construction 
of the plant expressible WMVII coat protein gene is referred to as 
pl8WMVII-exp, and is shown in Chart 9. 
25 Example 11 Construction of a Micro T-DNA Plasmid Containing the 
Plant-expressible WMVII Coat Protein Gene Construction. 

As depicted in Chart 10, the plant expression cassette for the 
WMVII coat protein gene was transferred into a suitable micro-T-DNA 
vector which contains the necessary Agrobacterium T-DNA transfer 
30 signals (to mediated transfer from an Agrobacterium and integration 
Into a plant genome) and wide-host range origin of replication (for 
replication in Agrobacterium) to form plasmid pGA482/G/CPWMVII-exp . 
To construct this plasmid, plasmid pl8WMVII-exp was digested with 
Hind III (which cuts within the polycloning sites of pUC18 , well 
35 outside of the expression cassette), and an 1.8 kb fragment 
containing the plant-expressible cassette was removed and ligated 
into the Hind III site of the modified Agrobacterium- derived micro - 
vector pGA482 (modification included the addition of the jS- 



WO 90/02184 



PCT/US89/03094 



-21- 

glucuronidase gene). The micro T-DNA vector, pGA482, is shown in 
Chart 7 and available from G. An, Institute of Biological Chemistry, 
Washington State University, Pullman, WA. The resulting plasmid was 
designated, pGA482/G/CPWMVII-exp is shown in Chart 10. This plasmid 
5 (or derivatives thereof) was transferred into virulent or avirulent 
strains of Agrobacterium tumefaciens or rhizogenes, such as A208, 
C58, LBA4404, C58Z707, A4RS , A4RS (pRiB278b) , and others. Strains 
A208 C58, LBA4404, and A4RS are available from American Type Culture 
Collection (ATCC) , 12301 Parklawn Drive, Rockville, MD. Bacteria 

10 A4RS(pRiB278b)is available from Dr. F. Casse-Delbart , C.N.R.A., 
Routede Saint Cyr. F78000, Versailles, France. Bacteria C58Z707 is 
available from Dr. A.G.Hepburn, Dept. of Agronomy, University of 
Illinois, Urbana, IL. 

After transfer of the engineered plasmid pGA482/G/CPWMVII-exp 

15 into any of the above listed Agrobacterium strains, these 
Agrobacterium strains can be used to transfer and integrate within a 
plant genome the plant-expressible WMVII coat protein gene contained 
within its T-DNA region. This transfer can be accomplished using the 
standard methods for T-DNA transfers which are known to those skilled 

20 in the art, or this transfer can be accomplished using the methods 
described in U.S. Patent application SN 07/135,655 filed December 21, 
1987 entitled "Agrobacterium Mediated Transformation of Germinating 
Plant Seeds". In addition, It has recently been shown that such 
Agrobacteria are capable of transferring and integrating their T-DNA 

25 regions into the genome of soybean plants . Thus these strains could 
be used to transfer the plant expressible WMVII coat protein gene 
into the genome of soybean to develop a soybean plant line which is 
resistant to infection from soybean mosaic virus strains. 
Example 12 Microprojectile Transfer of pWMVII-exp into Plant 

30 Tissues. 

Recently an alternative approach for the transfer and 
integration of DNA into a plant genome has been developed. This 
technique relies on the use of microprojectiles on which the DNA 
(plasmid form) is attached. These microprojectiles are accelerated 
35 to high velocities and their momentum is used to penetrate plant cell 
walls and membranes. After penetration into a plant cell the 
attached DNA leaches off the microprojectile and is transferred to 
the nucleus where DNA repair enzymes integrate the "free" DNA into 
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the plant genome. In Its present form the process is entirely 
random, but plant tissues which have been successfully transformed by 
the plasmid DNA (or part of it) can be identified and cultured to 
homogeneity by the use of selectable marker genes (such as the 
5 bacterial neomycin phosphotransferase II gene, NPTII) , or reporter 
genes (such as the bacterial beta- glucuronidase gene, Gus). 
Successful use of particle acceleration to transform plants has 
recently been shown for soybean and the transfer of pl8WMVII-exp into 
the genome could result in obtaining soybean plants which are 
10 resistant to infections from soybean mosaic virus strains. 

The use of this process for the transfer of pl8wMVTI-exp can be 
accomplished after the addition of either plant expressible genes 
NPTII or Gus genes or both. Plasmids that have the nptll and Gus 
genes to pl8WMVII-exp are shown in Chart 11, and referred to as 
15 pl8GWMVII-exp and pl8NGWMVII-exp . In addition, the construction 
described in Example 11 can also be used for microproj ectile transfer 
as it already has both the nptll and Gus genes attached to the 
pWMVII-exp cassette (see Chart 10) . The only difficulty which the 
use of pGA482GG/cpWMVII-exp may encounter during transfer by the 
20 microproj ectile process is due to its large size, about 18kb, which 
may have a lower efficiency transfer and such larger plasmid 
generally yield less DNA during propagation. 

To construct plasmid pl8GWMVii-exp , plasmid pl8WMVii-exp is 
digested with BamHI and ligated with a 3.0 kilobase BamHI isolated 
25 fragment containing the Gus gene. To construct plasmid pl8NGWMVii- 
exp, the plasmid pl8GWMVii-exp is digested with Smal and ligated with 
a 2.4 kb isolated fragment containing the Nos -nptll gene generated by 
digestion with Dral and Stul. 

Example 13 Identification of the ZYMV Coat Protein Gene. 

30 The cloned ZYMV cDNA insert from clone pZYMV-15, which was 

cloned using the method described above, was sequenced by using both 
the chemical (Maxam and Gilbert, Methods of Enzymology 65:499, 1980) 
and enzymatic (Sanger et al., Proc. Natl. Acad. Sci. U.S.A. 74:5463, 
1977) methods. Based on this information and comparative analysis 

35 with other potyviruses the nucleotide sequence of clone pZYMV-15 was 
found to contain a complete copy of the ZYMV coat protein gene. The 
N- terminus of the coat protein was suggested by the location of the 
dipeptide sequence Gln-Ser which is characteristic of cleavage sites 
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in potyviruses (see Dougherty et al. EMBO J. 7:1281, 1988). The 
length of the ZYMV coat protein gene coding region (280 amino acids) 
is consistent with a gene encoding a protein of about 31.3 kD. The 
sequences of this ZYMV coat protein gene and protein are shown in 
5 Chart 3. 

Example 14 Construction of a Plant -expressible ZYMV Coat Protein 
Gene Cassette with CaMV 35S Promoter and Polyadenylation Signal and 
CMV Intergenic Region and Translation Initiator ATG. 

As depicted in Chart 12 , attachment of the necessary plant 

10 regulatory signals to the ZYMV coat protein gene was accomplished by 
using the PCR technique to amplify the ZYMV coat protein gene using 
oligomers which would add the necessary sites to its 5' and 3' 
sequences . Following this amplification the resulting fragment is 
digested with the appropriate restriction enzyme and cloned into the 

15 Ncol site of the above expression cassette containing plasmid, 
pUC18CP-exp. Clones containing the ZYMV coat protein gene insert 
need only be checked to determine correct orientation with respect 
with the CaMV promoter. However, to ensure that no artifacts have 
been incorporated during ■ the PCR amplification the entire coat 

20 protein gene region is checked by nucleotide sequence analysis. 

To obtain the amplified ZYMV coat protein gene with Ncol 
restriction enzyme sites on both ends the following two oligomers 
were synthesized: 

1. 5' - ATCATTCCATGGGCACTCAACCAACTGTGGC-3 ' , which adds a Ncol 
25 site to the 5' -end of the ZYMV coat protein gene and retains the same 

ATG translation start codon which is present in the expression 
cassette, pUC18cpexp. 

2. 5 ' -AGCTAACCATGGCTAAAGATATCAAATAAAGCTG-3 ' , which adds a Ncol 
site to the 3' -end of the ZYMV coat protein gene, this site can be 

30 ligated into the expression cassette, pUC18cpexp. 

The cloning of this PCR ZYMV coat protein gene, using these two 
oligomers, into pUC18cpexp yields a plant expressible ZYMV gene 
(referred to as pUC18cpZYMV) which following transcription and 
translation will generate a ZYMV coat protein which is identical to 

35 that derived from the ZYMV coat protein gene nucleotide sequence, see 
Chart 3. However, this coat protein will differ, because of neces- 
sary genetic engineering to add the ATG initiation codon followed by 
Gly, which is the amino acid 3' adjacent to the Ser of the polyprot- 
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ein cleavage site (see Chart 3) . The Gly amino acid residue was 
selected for the potential N-terminal amino acid because many 
potyvirus coat proteins have either an Ser, Gly, or Ala at their N- 
terminal. However, if this is found to be a problem its replacement 
5 would involve the use of a different oligomer to obtain a different 
N-terminal amino acid for the ZYMV coat protein. The cloned 
construction of the plant expressible ZYMV coat protein gene is 
referred to pUC18cpZYMV, and is shown in Chart 12. 

Example 15 Construction of a Micro T-DNA Plasmid Containing the 

10 Plant- expressible ZYMV Coat Protein Gene Construction. 

Following the teachings of Example 11 with appropriate 
modifications, the construction of a micro T-DNA plasmid containing a 
plant- expressible ZYMV coat protein was constructed. Plasmid 
pUC18cpZYMV (Chart 12) was digested with Hind III (which cuts within 

15 the polycloning sites of pUC18, well outside of the expression 
cassette), and a 1.6 kb fragment containing the plant-expressible 
cassette was removed and ligated into the Hind III site of the micro- 
vector pGA482 (Chart 7). The resulting plasmid was designated, 
pGA482GG/cpZYMV is shown in Chart 13. 

20 After transfer of the engineered plasmid pGA482GG/cpZYMV into 

Agrobacterium strains, the Agrobacterium strains can be used to 
transfer and integrate within a plant genome the plant- expressible 
ZYMV coat protein gene contained within its T-DNA region. 
Example 16 Microprojectile Transfer of pUClScpZYMV into Plant 

25 Tissues . 

Following the teachings of Example 12, the microprojectile 
transfer technique can be used to introduce the ZYMV coat protein 
gene with appropriate genetic regulatory sequences into plant 
tissues . 

30 The use of this process for the transfer of pUC18cpZYMV can be 

accomplished after the addition of either plant expressible genes 
NPTII or Gus genes or both. Plasmids that have the nptll and Gus 
genes to pUClScpZYMV are shown in Chart 14 and referred to as 
P UC18GcpZYMV and pUClSNGcpZYMV . In addition, the construction 

35 described in Example 15 can also be used for microprojectile transfer 
as it already has both the nptll and Gus genes attached to the 
pUC18cpZYMV cassette (see Chart 13). The only difficulty which the 
use of pGA482GG/cpZYMV may encounter during transfer by the 
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microprojectile process is due to its large size, about 18kb, which 
may have a lower efficiency transfer and such larger plasmid 
generally yield less DNA during propagation. 

To construct plasmid pUC18GCPZYMV, plasmid pUC18CPZYMV is 
5 digested with BamHI and ligated to a 3.0 BamHI isolated fragment 
which contains the Gus gene. To construct plasmid pUC18GCPZYMV, 
plasmid pUC18GCPZYMV is digested with Smal and ligated with a 2.4 kb 
isolated fragment containing the Nos nptll gene isolated by digestion 
with Dral and Stul. 
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CHARTS 
CHART 1 



i 

CAGTCCAAGAATCAACCTGTCCATGCTGGTTTGAATGAAAAACTCAAAGAGAAGGAAAAT 

_ ^ + — — — ♦ 

G! nSerLysAsnGluAl «V« I A»pA I »G I yLeuA.nG I uLy »L«uLy »G I uLy «G I uAsn 



CAGAAAGAAAAAGAAAAAGAAAAAC AAAAAGAGAAAGAAAAAGACGGTGCTAGTGACGGA 
. ^ ^ + - ______—- + 

GlnLy»GluLysGluLy»Gl uLy «G lnLy»GluLy*G!uLy»A»pGIyAl»S«rAapG!y 



AATGATGTGTCAACTAGCACAAAAACTGGAGAGAGAGATAGAGATGTCAATGTTGGGACC 

121 ♦ * ♦ * ♦ 

AsnAspV. IS«rThrS«rThrLy»ThrGlyGluArgAspAr fl A« P V«IA»nVi IGlyThr 



♦ 180 



4 240 



AGTGGAACTTTCACTGTTCCGAGAATTAAATCATTTACTGATAAGATGGTTCTACCGAGA 

Igl _ 4. * — — — — — — — — — — ~ — 

S«rGlyThrPheThrV»IProArgI ULysSerPh«ThrA*pLy»M«tV»IL«uProArg 

ATTAAGGGGAAGACTGTCCTTAATTTAAATCATCTTCTTCAGTACAATCCGCAACAAATT 

2 4i + • ♦ ♦ * * ♦ 300 

1 1 eLysG ! yLy aThrVi I LeuAsnLeuAsnH i sL«uLeuG I nTy rAsnProG I nG I nl le 

GACATTTCTAAC ACTCGTGCCACTCATTCAC AATTTGAGAAGTGGTATGAGGGAGTGAGG 

301 + * ♦ ♦ + ♦ 360 

As P II«S«rAsnThrArgAI«ThrHl«SerGlnPh«GluLy«TrpTyrGluGlyVilAr fl 

AATGATTATGGCCTTAATGATAATGAAATGCAAGTGATGCTAAATGGTTTGATtSGTTTGG 

361 « * ♦ * * ♦ 420 

AsnAspTyrG I yLeuAsnAspAsnG I uM»tG 1 nV» I M«tL«uA*nG I yL«uMetV» I Trp 

TGTATCGAGAATGGTACATCTCCAGACATATCTGGTGTCTGGGTTATGATGGATGGGGAA 

421 ♦ ♦ * ♦ * * 480 

CysIUGIuAsnGlyThrSerProAspIleSerGlyV.ITrpVmlMetMetAspGlyGlu 

ACCCAAGTTGATTATCCAATCAAGCCTTTGATTGAGCATGCTACTCCGTCATTTAGGCAA 

481 ♦ + ■— ♦ ♦ ♦ * 540 

ThrG I nVi I AspTy rPrel ULy «ProL»ul I«»G I uH I »A I .ThrProSerPh.ArgG I n 

• ATTATGGCTCACTTTAGTAACGCGGCAGAAGCATACATTGCGAAGAGAAATGCTACTGAG 

54! * . ♦ ♦ ♦ * 600 

ll«M«tAI»HlsPh»S«rA»nAlaAI«GluAl»TyrIl»AI«Ly»ArgA*nAI«ThrGlu 
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CHART 1 (Continued) 



ACCTACATGCCQCGGTATCGAATCAACACAAATTTCACTCACATTACCCTCCCTACATAC 

ee , ♦ - — — * — * * 860 

ArgT r rU«tP roAraTyrClyIleLysArgAsnL.uThrA.pl I «S.r L«u A I • Ar gTy r 

CCTTTCGACTTCTATGAGGTGAATTCGAAAACACCTGATAGGGCTCGCGAAGCTCACATG ^ 
661 AliPhiAspPh«TyrGluV»IA»nS«rLy«ThpProA»pArgAltArgOluAI«H1sMet 



CAGATGAAGGCTGCAGCGCTGCGAAACACCAGTCGCAAAATGTTTGCTATGCACGGCAGT 
721 GlnMetLysAI.AI.AI^LeuArgAsnThrS.rArgLy.U.tPh.GlyMetA 



780 



GTTAGTAACAAGGAAGAAAACACGGAGAGACACACAGTGGAAGATGTCAATAGAGACATG ^ 
781 vriarAsnLysGluGluAsnThrGluArgHJ.Thr 

CACTCTCTCCTGGGTATGCGCAACTAAATACCTGCGCTTGTGTGTTTGTTGAGTCTGACT 

841 --- * ♦ * + 

H 1 sSerLeuLeuG I yWetArgAsntnd 

CGACCCTGTTTCACCTTATCGTACTATATAAGCATTAGAATACACAGTGGC1GCGCCACC 
901 * ♦ ■» * * * 

GCTTCTATTTTACAGTGAGGGTAGCCCTCCGTGCTTTTAGTATTATTCGAGTTCTCTGAG 
961 ♦ * ♦ * * * 

TCTCCATACAGTGTGGGTGGCCCACGTGATATTCGACCCTCTTAGAATGAGAAAAAAAAA ^ 
1021 * "* * * + * 

AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAGCAATTCC 
1081 * * * * 
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CHART 2 

GTGTCTTTACAATCAGGAAAAGAAACAGTTGAAAATTTGGACGCAGGGAAAGAATCAAAG 

! 60 

Va ISerLeuGI nS«rG lyLysG I uThrVa I G I uAsnLeuAspA 1 aG I yLysG I uSerLys 

AAAGATGCCAGTGACAAAGGGAATAAGCCGCAGAACTCGCAAGTTGGTCAGGGTAGCAAG 

61 «, * ♦ + ♦ * 120 

LysAspAlaSerAspLysGlyAsnLysProGlnAsnSerGlnVa IGlyGlnGlySerLys 

GAACCAACAAAAACCGGCACAGTCAGCAAGGATGTAAATGTTGGATCGAAAGGAAAAGAA 

12 i ♦ + * * * * I 80 

GIuProThrLysThrGlyThrVa I SerLy sAspVa I AsnVa I G I ySerLy sG I yLy sG I u 

GTCCCACGACTACAAAAGATAACAAAGAAAATGAATCTTCCAACAGTTGGTGGGAAAATC 

1S1 + * ♦ - ^ 40 

Va I ProArgLeuG I nLy si 1 eThrLysLy sMetAsnLeuProThrVa I G I yG I yLy si I e 

ATTCTTAGCTTAGACCATTTGCTTGAGTACAAACCTAGTCAAGTTGATTTGTTTAACACT 

241 * * * + + 300 

1 1 eLeuSerLeuAspH i sLeuLeuG I uTy rLysProSerG I nVa I AspUsuPSeAsnThr 

CGAGCAACAAAAACACAATTTGAATCATGGTACAGCGCAGTCAAAGTTGAGTATGATCTT 

301 + + ♦ + * 360 

ArgA I aThrLysThrG I nPheG I uSerTrpTy rS«r A I aVa I LysVa I G I uTy r AspLeu 

AATGATGAGCAAATGGGTGTGATTATGAATGGTTTTATGGTTTGGTGTATCGATAACGGT 

361 + + + ■* + ♦ 420 

AsnAspG I uG I nMetG I yVa 1 1 1 ©Met As nG I yPheMetVa I TrpCysI I eAspAsnG I y 

ACATCTCCAGATGTCAATGGAGTGTGGGTAATGATGGATGGGGAAGAGCAAGTTGAGTAC 

421 f + ♦ * ♦ * 480 

ThrSerProAspVa I AsnG I yVa I TrpVa I MetMetAspG I yG I uG I uG I nVa I G I uTy r 

CCACTAAAGCCAATTGTTGAAAATGCAAAGCCAACTTTGAGACAAATCATGCACCATTTC 

481 + * + ♦ ♦ + 540 

ProLeuLysProI I eVa I G I uAsnA I aLy sProThrLeuArgG I nl I ©MetHi sH i sPhe 

TCAGATGCAGCGGAAGCATATATTGAAATGAGAAACTCTGAAAGTCCGTATATGCCTAGA 

541 ♦ * * + * + 600 

SerAspA f aA I aG I uA I aTyrl I «G I uM«tArgAsnSerG I uSerProTy rMetProArg 

TACGGATTACTGAGAAATTTGAGAGACAGGGAATTAGCACGCTATGCTTTTGACTTCTAT 

6 01 + 1_+ ♦ + ♦ ♦ 660 

Ty rGI yLeuLeuArgAsnLeuArgAspArgG I uLeuA I aArgTy rA I aPheAspPheTy r 

GAGGTTACTTCTAAAACGCCAAATAGGGCAAGAGAAGCAATAGCTCAAATGAAGGCCGCG 

661 + + + «• * + 720 

G I uVa I ThrSerLysThrProAsnArgA I aArgG I uA 1 al I eA I aG I nMetLy sA I aA I a 



GCTCTCGCGGGAGTTAACAGCAGGTTATTTGGACTTGATGGTAATATCTCGACCAATTCC 

721 + + + + * + 

A I aLeuA I aG 1 y Va I AsnSerArgLauPhoG I yLeuAspG I yAsn 1 1 eSerTh rAsnSer 



+ 780 
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CHART 2 (Continued) 



GAAAATACTGGGAGGCACACTGCAAGGGACGTAAATCAGAATATGCATACTTTGTTGGGT 
^ + _ + + _____ _ + — — — — ♦ 

G I uAsnThrG I yArgH 1 sThrA I aArgAspVa I AsnG I nAsnMetH 1 sTh rLeuleuG I y 



ATGGGTCCACCGCAGTAAAGACTAGGTAAACTGGTCACAGTTAGCATTTCGGGTCGTTAT 

841 «. * * ♦ + * 900 

MotGlyProProGlnEnd 



AAftTTTTCTATAATATAACATflTCGCACTTTATTTTAGTATAftTGTftiTTTrATrArrTT 
901 - * ♦ — — + 960 

TGTACTGTTTATGTTAGCGTGGTTTAACCACCTTTGTGTGTGCTTTATATTATAGTTTAT 
961 «. — + * ♦ ♦ ♦ 1020 

GCGTAGCAGGGAGAACCATTACAATGCCGGAGTTGTTTGTAGTGTGATTTCATCACGGTT 
1021 * ♦ * ♦ ♦ ♦ 1080 

AATAGCCGAGGTACGGTAATGTTTGTTGCCTAAAAAAAAAAAAAAAAAAAAAAAA 
1081 ♦ — ♦ * * 1135 
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CHART 3 



ATGCTCCAATCAGGCACTCAACCAACTGTGGCACACGCTAGAGTTACAAAGAAAGATAAA 

1 + ♦ ♦ , + QQ 

MetLeuG I n^«rGI yThrG I nProThrVa I A I a Asp A I sArgVi I ThrlysLy sAspLy s 

GAAGATGACAAAGGGGAAAACAAGGATTTCACAGGCTCCGGCTCAGGTGAGAAAACAGTA 

61 * * + + ♦ + 120 

G I uAspAspLysG 1 yG I uAsnLysAspPhaThpG I ySarG I ySerG I yG I uLysThrVa I 

GTAGCTGCCAAGAAAGACAAGGATGTGAATGCTGGTTCTCATGGGAAAATTGTGCCGCGT 
121 * * + _ «. , 180 

Va I A I aA I aLysLy sAspLy sAspVa I AanA ! aG I y S«rH i «G I yLy si I «Va ! ProArg 

CTTTCGAAGATCACAAAGAAAATGTCATTGCCACGCGTGAAAGGGAATGTGATACTCGAT 

181 ♦ ♦ ♦ ♦ ♦ 240 

LeuSarLysI I eThrLy sLy sM«tSerL«uProArgVa I LysG I yAsnVa 1 1 1 eLeuAsp 

ATCGATCATTTGCTGGAATATAAGCCGGATCAAATTGAGTTATACAACACACGAGCGTCT 
241 ♦ * + + ♦ 300 

1 1 «AspH 5 sL«uL«uG I uTy rlysProAspG Inl leGI uL«uTy rAsnThrAr 0 A I «Ser 

CATCAGCAATTTGCCTCTTGGTTCAACCAAGTTAAGACAGAATATGATCTGAATGATCAA 
301 . ♦ ♦ ♦ + «. + 360 

Hi aG I nG I nPheA I »S«rTrpPh«AsnG I nVa I LysThrG I uTyrAspLeuAsnAspG I n 



CAGATGGGAGTTGTGATGAACGGTTTCATGGTTTGGTGTATTGAAAATGGCACCTCACCT 
G I nM«tG I y Va I V. I MetAsnG I yPheMetVa I TrpCy si I eG I uAsnG I yThrSerPro 



GACATTAATGGAGTGTGGTTTATGATGGACGGAAATGAACAAGTTGAGTATCCTTTGAAA 
421 ♦ ♦ ♦ * ♦ + 48 0 

Aspl I •AsnG I y Va I TrpPheM«tM«tA»pG I yAsnG I uG I nV» I G I uTy rProLauLy s 

CCGATAGTTGAAAATGCAAAGCCAACGCTGCGGCAAATAATGCATCATTTTTCAGATGCA 
481 ♦ * ♦ * + + 64 0 

Prol UVa I G I uAsnA I aLysProThrLauArgG I nl laUetH i sH i sPhaSerAspA I a 

GCGGAGGCATATATAGAGATGAGAAATGCAGAGGCACCATACATGCCGAGGTATGGTTTG 
641 * ♦ ♦ + + + 800 

A I aG I uA laTyrl I «G I uMatArgAsnA I aG I uA I aProTy rMatProArgTy pG I yLau 

i 
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CHART 3 (Continued) 



CTTCGAAACCTACGGGATAGGAGTTTAGCACGATACGCTTTCGATTTCTATGAAGTCAAT 

601 + ♦ ♦ + ♦ 660 

LsuArgAsnLeuArgAspArgSerLeuA I »ArgTy rA I «Ph«AspPh«Ty rG I uV« I Asn 

TCTAAAACTCCTGAAAGAGCCCATGAAGCTGTTGCGCAGATGAAAGCAGCAGCTCT7AGC 

861 ♦ ♦ •» ♦ ♦ * 720 

SerLysThrProGluArgAI»HisGluAI»V»IAI»GlnM«tLysAI»AI»AI»LeuS«r 

AATGTTTCTTCAAGTGTGTTTGGCCTTAGTGAAATCGTTGCCACCACTAGCGAAGCCACA 

721 * * * » ♦ ♦ 780 

AsnVa I S«rS«rS«rV» I Ph«G I yL«uSerG lull «V» I A I »ThrThrSerG I uA I »Thr 

CTGAACGGCACACTGCACGTGATGTTAATAGAAACATGCCACACCTTACTAGGTGTGAAT 

781 * * ♦ ♦ * 840 

LeuAsnGlyThrL«uHisV»IMetLeuIleGluThrCysHisThrLeuLeuGlyV»IAsn 

ACAATGCAGTAAAGGGTAGGCCGCCTACCTAGGTTATCGCTTCGCTGCCGACGTAATTCT 

841 ♦ ♦ ♦ ♦ * ♦ 909 

ThrMetGlnEnd 

AATATTTACCAGCTTTATTTGATATCTTTAGATTTCCAGAGTGGGCCTCCCACCTTTAAA 
901 ♦ ♦ + ♦ ♦ ♦ 960 

GCGTAGAGTTTATGCTTAGTTGTCCAGGAGTGCCGTAGTCCTGTCGGAAGCTTTAGTGTG 
961 ♦ ♦ ♦ * ♦ ♦ 1020 

AGCCTCTCACGAATAAGCTCGAGATTACACTCCGTTTGCAAGCCTAAAAAAAAAAAAAAA 
1021 ♦ ♦ * ♦ ♦ * 1080 

AAAA 

1081 1084 
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CHART 4 



PRV-d CAGTOCAAGAATGAAGCTGTGGATGCTGCTTTGAATGAAAAACTCAAAGA 50 

! | I | I | I | | | | | I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I 

PRV-w CAGTCCAAAAATGAAGCTGTGGATACTGGTTTGAATGAAAAATTCAAAGA 50 

GAAGGAAAATCAGAAAGAAAAAGAAAAAGAAAAACAAAAAGAGAAAGAAA 100 

MINIM IIIIIIIIHIIilllllllllltllllllllllllllll 

aaaggaaaaacagaaagUUagaaaaagaaaaacaaaaagagaaagaaa 100 

AAGACGGTGCTAGTGACGGAAATGATGTGTCAACTAGCACAAAAACTGGA 150 

GAGAGAGATAGAGATGTCAATGTTGGGACCAGTGGAACTTTCACTGTTCC 200 



250 
250 



GAGAATTAAATCATTTACTGATAAGATGGTTCTACCGAGAATTAAGGGGA 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I III II UN I 

gagaUtaUtcatttactgataagatgattctaccgagaattaagggaa 

AGACTGTCCTTAATTTAAATCATCTTCTTCAGTACAATCCGCAACAAATT 300 

GACATTTCTAACACTCGTGCCACTCATTCACAATTTGAGAAGTGGTATGA 350 

GGGAGTGAGGAATGATTATGGCCTTAATGATAATGAAATGCAAGTGATGC 400 

TAAATGGTTTGATGGTTTGGTGTATCGAGAATGGTACATCTCCAGACATA 450 
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CHART 4 (Continued) 



TCTGGTGTCTGGGTTATCATGGATGGGGAAACCCAAGTTCATTATCCAAT 500 

I | I I I I I I I I I I I i I I I I I I I I 'I I I I I I I I I I I I I I I I I I I I i I M I I M 

TCTGGTGTCTGGGTTATGATGGATGGGGAAACCCAAGTTGATTATCCAAT 600 

CAAGCCTTTGATTGAGCATGCTACTCCGTCATTTAGGCAAATTATGGCTC 550 

I! I Mil II llllllllllllllllllllllllllllllllllllllll 

CAAGCCTTTAATTGAGCATGCTACTCCGTCATTTAGGCAAATTATGGCTC 550 

ACTTTAGTAACGCGGCAGAAGCATACATTGCGAAGAGAAATGCTACTGAG 600 

llllllllllilllllllllllllllllllllll lllllllllllllll o 

ACTTTAGTAACGCGGCAGAAGCATACATTGCGAAAAGAAATGCTACTGAG 600 

AGGTACATGCCGCGGTATGGAATCAAGAGAAATTTGACTGACATTAGCCT 650 

I I I I I I I I I I I I I I I I I I I I I I I I i M I I I I i I I I I I I I I I I I I I I I I I I 

AGGTACATGCCGCGGTATGGAATCAAGAGAAATTTGACTGACATTAGCCT 650 

CGCTAGATACGCTTTCGACTTCTATGAGGTGAATTCGAAAACACCTGATA 700 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

CGCTAGATACGCTTTCGACTTCTATGAGGTGAATTCGAAAACACCTGATA 700 

GGGCTCGCGAAGCTCACATGCAGATGAAGGCTGCAGCGCTGCGAAACACC 750 

AGTCGCAAAATGTTTGGTATGGACGGCAGTGTTAGTAACAAGGAAGAAAA 800 

CACGGAGAGACACACAGTGGAAGATGTCAATAGAGACATGCACTCTCTCC 850 

TGGGTATGCGCAACTAA 867 

lilllllllllllllll 
TGGGTATGCGCAACTAA 867 
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CHART 5 



SMV SGKEKEGDWDAOKDPKKSTSSSKG AGTSSKDVNV 34 

Mil 1 I I I I I I t I I I I I I I I I I I 

WMVII SGKETVENLDAGKESKKDASDKGNKPQNSQVGQGSKEPTKTGTVSKDVNV 50 

SMV GSKGKVVPRLQKITRKMNLPMVEGKIILSLOHLLEYKPNQVOLFNTRATR 84 

smv Yin i i r ii 1 1 1 1 1 u n i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 milium 

WMVII GSKGKEVPRLQKITKKMNLPTVGGKI «• 

SMV TQFEAWYNAVKDEYELDDEQMGVVMNGFMVWCIDNGTSPDANGYWVMMDG 134 

wmvii eeqvevplkpivenakMlrqimhhfsoaaeayiemrnsespymprygll see 

SMV RNLRDRELARYAFDFYEVTSKTPNRAREAIAQMKAAALSGVNNKLFGLDG 234 

WMVII ^iii^iWipipUvlsilpiUpiiUiiiliiiAGWsRipGiiG 2 S, 

SMV NISTNSENTERHTARDVNQNMHTLLGMGPPQ 265 
I I I 1 1 I I III III I III III Ml III I i I I I 

WMVII NISTNSENTGRHTARDVNQNMHTLLGMGPPQ 281 
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pPRV117 







* 




* 


* 


10 




PRV-p Coat Protein Gene 





15 



20 



25 



pTJC1813/Cpl9 



Hindlll 



Ncol Tthllll 



BstXI Ncol 



1 






4. 

LJJ 


1 


1. 1 

I L 




4 




r ca 






CMV Coat Protein Gene 




s ca 





30 P UC1813/Cpl9-PRVexp 



Hindlll 



35 



Hindlll 



Ncol Tthllll 



BstXI Ncol 



Hindlll 



4- 

_ u 




4- 

1 U 


■ 


!• 


1 I 
1 




4 

I J_ 




p ca 




CMV 


PRV-p Coat 




s ca 





(16AA) 



gene 



WO 90/02184 



PCT/US89/03094 



-36- 
CHART 7 
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pWMVII-41-3.2 



* 




* 


10 


WMVII Goat Protein 





Gene 



PCR Generated Gene 



Ncol 



Ncol 
4- 



WMVII Coat Protein Gene 



30 



35 



pl8WMVII-exp 

Hindlll 
* 



Ncol 



Ncol Hindlll BamHI Smal 



4- 




■ 


V 


■ 




4-4-4. 
J I 1— 




p 

•^ca 




WMV Coat Protein 




s ca 





Gene 



WO 90/02184 



PCT/US89/03094 



-39- 
CHART 10 
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Hindi I I 



Ncol 



Ncol Hindlll 
4- l 



10 



Nos 



ca 



WMVII Coat 
Protein 
Gene 



CaMV 

Gus 

Gene 



15 



WO 90/02184 



PCT/US89/03094 



-40- 
CHART 11 
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CIAIMS 

1. A recombinant DNA molecule which encodes a potyvirus coat 
protein, said recombinant DNA molecule selected from the group 
consisting of Papaya ringspot virus strain papaya ringspot PRV-p coat 

5 protein gene, Watermelon mosaic virus II WMVII coat protein gene, and 
Zucchini yellow mosaic virus ZYMV coat protein gene. 

2. A recombinant DNA molecule according to Claim 1 wherein said 
recombinant DNA molecule encodes Papaya ringspot virus strain papaya 

10 ringspot PRV-p coat protein, said recombinant DNA molecule having the 
nucleotide sequence shown in Chart 1. 

3. A recombinant DNA molecule according to Claim 1 wherein said 
recombinant DNA molecule encodes Watermelon mosaic virus ,11 WMVII 

15 coat protein, said recombinant DNA molecule having the nucleotide 
sequence shown in Chart 2. 

4. A recombinant DNA molecule according to Claim 1 wherein said 
recombinant DNA molecule encodes Zucchini yellow mosaic virus ZYMV 

20 coat protein, said recombinant DNA molecule having the nucleotide 
sequence shown in Chart 3. 

5. A recombinant DNA molecule according to Claim 1 further 
comprising: 

25 a) a promoter; 

b) an initiation region; and, 

c) a poly (A) addition signal; 

wherein said promoter is upstream and operably linked to said 
Initiation region, said initiation region is upstream and operably 
30 linked to said recombinant DNA molecule encoding a coat protein, and 
said recombinant DNA molecule encoding a coat protein is upstream and 
operably linked to said poly (A) addition signal. 

6. A recombinant DNA molecule according to Claim 5 wherein said 
35 promoter is Cauliflower mosaic virus CatfV 35S promoter. 

7. A recombinant DNA molecule according to Claim 5 wherein said 
initiation region is selected from the group consisting of an 
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5 



30 



35 



initiation region derived from the 5' untranslated region of Cucumber 
mosaic virus CMV coat protein gene and an initiation region derived 
from the 5' untranslated region of SS RUBISCO gene. 



8. A recombinant DNA molecule according to Claim 5 wherein said 
initiation region comprises the sequence AAXXATGG. 



9. A recombinant DNA molecule according to Claim 5 wherein said 
poly(A) addition signal is selected from the group consisting of: the 

10 poly (A) signal derived from Cauliflower mosaic virus CaMV 35S gene; 
the poly (A) signal derived from phaseolin storage protein gene; the 
poly (A) signal derived from nopaline synthase gene; the poly (A) 
signal derived from octopine synthase gene; the poly (A) signal 
derived from bean storage protein gene; and, the poly (A) signal 

15 derived from SS RUBISCO. 

10. A recombinant DNA molecule according to Claim 6 wherein said 
initiation region is derived from the 5' untranslated region of 
Cucumber mosaic virus CMV coat protein gene and said poly (A) addition 

20 signal is derived from Cauliflower mosaic virus CaMV 35S gene. 

11. A recombinant DNA molecule according to Claim 5 further 
comprising an AT rich 5' untranslated region wherein: 

a) said AT rich region is downstream from said promoter and 
25 upstream from said initiation region; 

b) said initiation region comprises the sequence AAXXATGG; 

and, 

c) said poly(A) addition signal contains untranslated 
flanking sequences. 



12. A recombinant DNA molecule according to Claim 14 wherein said 
promoter is Cauliflower mosaic virus CaMV 35S promoter. 

13. A recombinant DNA molecule according to Claim 11 wherein said 
AT-rich 5' untranslated region is derived from the 5' untranslated 
region of a gene selected from the group consisting of Cucumber 
mosaic virus CMV coat protein gene and SS RUBISCO gene. 
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14. A recombinant DNA molecule according to Claim 11 wherein said 
initiation region is derived from the 5' untranslated region of a 
gene selected from the group consisting of Cucumber mosaic virus CMV 
coat protein gene and SS RUBISCO gene. 

5 

15 . A recombinant DNA molecule according to Claim 11 wherein said 
initiation region comprises the sequence AAXXATGG. 

16. A recombinant DNA molecule according to Claim 11 wherein said 
10 poly (A) addition signal is selected from the group consisting of: the 

poly (A) signal derived from Cauliflower mosaic virus GaMV 35S gene; 
the poly (A) signal derived from phaseolin storage protein gene; the 
poly (A) signal derived from nopalinesynthase gene; the poly(A) signal 
derived from octopine synthase gene; the poly (A) signal derived from 
15 bean storage protein gene; and, the poly(A) signal derived from SS 
RUBISCO . 

17. A recombinant DNA molecule according to Claim 12 wherein; 

a) said AT rich 5' untranslated region and said initiation 
20 region are derived from the 5' untranslated region of Cucumber mosaic 

virus CMV coat protein gene; 

b) said initiation region comprises the sequence AAXXATGG; 

and, 

c) and said poly(A) addition signal is derived from Cauli- 
25 flower mosaic virus CaMV 35S gene. 

18 . A transformed plant cell containing a recombinant DNA molecule 
according to Claim 5 . 

30 19. A transformed plant cell according to Claim 18 containing a 
recombinant DNA molecule according to Claim 10. 

20. A transformed plant cell according to Claim 18 containing a 
recombinant DNA molecule according to Claim 11. 

35 

21. A transformed plant cell according to Claim 20 containing a 
recombinant DNA molecule according to Claim 17. 
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22. A transgenic plant selected from the group consisting of the 
families Cucuribitaceae , Caricaceae, Solanaceae, and Leguminoseae 
comprising transformed plant cells according to Claim 18. 

5 23. A transgenic plant according to Claim 22 comprising transformed 
plant cells according to Claim 19 . 

24. A transgenic plant according to Claim 22 comprising transformed 
plant cells according to Claim 20. 

10 

25. A transgenic plant according to Claim 24 comprising transformed 
plant cells according to Claim 21. 

26. A process for producing a transgenic plant which is resistant to 
15 viral infection comprising the steps of: 

.a) constructing a recombinant DNA molecule according to 

Claim 5; 

b) transforming plant cells with said recombinant DNA; and 

c) regenerating plants from said transformed plant cells. 
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